AITopics | lorenz curve

Collaborating Authors

lorenz curve

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Two-sided fairness in rankings via Lorenz dominance

Neural Information Processing SystemsApr-25-2026, 17:31:33 GMT

We consider the problem of generating rankings that are fair towards both users and item producers in recommender systems. We address both usual recommendation (e.g., of music or movies) and reciprocal recommendation (e.g., dating). Following concepts of distributive justice in welfare economics, our notion of fairness aims at increasing the utility of the worse-off individuals, which we formalize using the criterion of Lorenz efficiency. It guarantees that rankings are Pareto efficient, and that they maximally redistribute utility from better-off to worse-off, at a given level of overall utility. We propose to generate rankings by maximizing concave welfare functions, and develop an efficient inference procedure based on the Frank-Wolfe algorithm. We prove that unlike existing approaches based on fairness constraints, our approach always produces fair rankings. Our experiments also show that it increases the utility of the worse-off at lower costs in terms of overall utility.

artificial intelligence, fairness, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Distribution-Free Statistical Dispersion Control for Societal Applications

Neural Information Processing SystemsFeb-15-2026, 12:18:11 GMT

Previous work has focused mainly on bounding either the expected loss of a predictor or the probability that an individual prediction will incur a loss value in a specified range.

artificial intelligence, machine learning, probability, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

48259990138bc03361556fb3f94c5d45-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 11:47:03 GMT

exposure, fairness, recommendation, (15 more...)

Neural Information Processing Systems

Country:

South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
North America > United States (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Media (0.45)
Leisure & Entertainment (0.45)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Information Management (0.67)

Add feedback

Gini Score under Ties and Case Weights

Brauer, Alexej, Wüthrich, Mario V.

arXiv.org Machine LearningNov-20-2025

The Gini score is a popular statistical tool in model validation. The Gini score has originally been introduced and used for binary responses Y {0, 1}, and there are many equivalent formulations of the (binary) Gini score such as the receiver operating curve (ROC) and the area under the curve (AUC); see, e.g., [Bamber (1975)], [Hanley-McNeil (1982)] and [Fawcett (2006)]. These different formulations are also equivalent to the Wilcoxon-Mann-Whitney's U statistic, see [Hanley-McNeil (1982)], [DeLong et al. (1988)], [Byrne (2016)], and to [Somers (1962)]'s D, see [Newson (2002)]. Thus, there are at least five equivalent formulations of the Gini score in a binary context, and there is a broad literature on its behavior which is well understood. When it comes to general real-valued responses, things become more difficult, and definitions and results on the Gini score are mainly found in the credit risk and actuarial literature. In this stream of literature, the Gini score has been introduced by [Gourieroux-Jasiak (2007)], [Frees et al. (2011), Frees et al. (2013)]. Furthermore, in the real-valued setting the Gini score is studied in much detail in [Denuit et al. (2019)] and [Denuit-Trufin (2021)]. The Gini score is a statistic that assesses whether a given risk ranking is correct.

artificial intelligence, gini score, machine learning, (16 more...)

arXiv.org Machine Learning

2511.15446

Country: Europe (0.28)

Genre: Research Report (0.64)

Industry: Banking & Finance > Insurance (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Distribution-Free Statistical Dispersion Control for Societal Applications

Neural Information Processing SystemsOct-8-2025, 23:34:25 GMT

Previous work has focused mainly on bounding either the expected loss of a predictor or the probability that an individual prediction will incur a loss value in a specified range.

artificial intelligence, machine learning, probability, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Generative Bayesian Computation for Maximum Expected Utility

Polson, Nick, Ruggeri, Fabrizio, Sokolov, Vadim

arXiv.org Machine LearningAug-28-2024

Generative Bayesian Computation (GBC) methods are developed to provide an efficient computational solution for maximum expected utility (MEU). We propose a density-free generative method based on quantiles that naturally calculates expected utility as a marginal of quantiles. Our approach uses a deep quantile neural estimator to directly estimate distributional utilities. Generative methods assume only the ability to simulate from the model and parameters and as such are likelihood-free. A large training dataset is generated from parameters and output together with a base distribution. Our method a number of computational advantages primarily being density-free with an efficient estimator of expected utility. A link with the dual theory of expected utility and risk taking is also discussed. To illustrate our methodology, we solve an optimal portfolio allocation problem with Bayesian learning and a power utility (a.k.a. fractional Kelly criterion). Finally, we conclude with directions for future research.

generative method, learning, neural network, (13 more...)

arXiv.org Machine Learning

2408.16101

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Spain > Galicia > Madrid (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning

Springer, Jacob Mitchell, Nagarajan, Vaishnavh, Raghunathan, Aditi

arXiv.org Artificial IntelligenceMay-30-2024

Sharpness-Aware Minimization (SAM) has emerged as a promising alternative optimizer to stochastic gradient descent (SGD). The originally-proposed motivation behind SAM was to bias neural networks towards flatter minima that are believed to generalize better. However, recent studies have shown conflicting evidence on the relationship between flatness and generalization, suggesting that flatness does fully explain SAM's success. Sidestepping this debate, we identify an orthogonal effect of SAM that is beneficial out-of-distribution: we argue that SAM implicitly balances the quality of diverse features. SAM achieves this effect by adaptively suppressing well-learned features which gives remaining features opportunity to be learned. We show that this mechanism is beneficial in datasets that contain redundant or spurious features where SGD falls for the simplicity bias and would not otherwise learn all available features. Our insights are supported by experiments on real data: we demonstrate that SAM improves the quality of features in datasets containing redundant or spurious features, including CelebA, Waterbirds, CIFAR-MNIST, and DomainBed.

arxiv preprint arxiv, dataset, hard feature, (15 more...)

arXiv.org Artificial Intelligence

2405.20439

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Distribution-Free Statistical Dispersion Control for Societal Applications

Deng, Zhun, Zollo, Thomas P., Snell, Jake C., Pitassi, Toniann, Zemel, Richard

arXiv.org Machine LearningSep-24-2023

Explicit finite-sample statistical guarantees on model performance are an important ingredient in responsible machine learning. Previous work has focused mainly on bounding either the expected loss of a predictor or the probability that an individual prediction will incur a loss value in a specified range. However, for many high-stakes applications, it is crucial to understand and control the dispersion of a loss distribution, or the extent to which different members of a population experience unequal effects of algorithmic decisions. We initiate the study of distribution-free control of statistical dispersion measures with societal implications and propose a simple yet flexible framework that allows us to handle a much richer class of statistical functionals beyond previous work. Our methods are verified through experiments in toxic comment detection, medical imaging, and film recommendation.

artificial intelligence, machine learning, probability, (17 more...)

arXiv.org Machine Learning

2309.13786

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > New York (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Baselines for Identifying Watermarked Large Language Models

Tang, Leonard, Uberti, Gavin, Shlomi, Tom

arXiv.org Artificial IntelligenceMay-29-2023

Generated Text Detection Via Statistical Discrepancies Recent methods such as DetectGPT and GPTZero distinguish We consider the emerging problem of identifying between machine-generated and human-written text the presence and use of watermarking schemes by analyzing their statistical discrepancies (Tian, 2023; in widely used, publicly hosted, closed source Mitchell et al., 2023). DetectGPT compares the log probability large language models (LLMs). We introduce a computed by a model on unperturbed text and perturbed suite of baseline algorithms for identifying watermarks variations, leveraging the observation that text sampled from in LLMs that rely on analyzing distributions a LLM generally occupy negative curvature regions of the of output tokens and logits generated by model's log probability function. GPTZero instead uses watermarked and unmarked LLMs. Notably, watermarked perplexity and burstiness to distinguish human from machine LLMs tend to produce distributions text, with lower perplexity and burstiness indicating that diverge qualitatively and identifiably from a greater likelihood of machine-generated text.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.18456

Genre: Research Report (0.65)

Industry: Information Technology > Security & Privacy (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback